Physical Representation Learning and Parameter Identification from Video Using Differentiable Physics

نویسندگان

چکیده

Abstract Representation learning for video is increasingly gaining attention in the field of computer vision. For instance, prediction models enable activity and scene forecasting or vision-based planning control. In this article, we investigate combination differentiable physics spatial transformers a deep action conditional representation network. By our model learns physically interpretable latent can identify physical parameters. We propose supervised self-supervised methods architecture. experiments, consider simulated scenarios with pushing, sliding colliding objects, which also analyze observability properties. demonstrate that network learn to encode images properties like mass friction from videos sequences. evaluate accuracy training methods, ability method predict future frames input actions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Video Representation Learning Using Discriminative Pooling

Popular deep models for action recognition in videos generate independent predictions for short clips, which are then pooled heuristically to assign an action label to the full video segment. As not all frames may characterize the underlying action—indeed, many are common across multiple actions—pooling schemes that impose equal importance on all frames might be unfavorable. In an attempt to ta...

متن کامل

Learning Compact Appearance Representation for Video-based Person Re-Identification

This paper presents a novel approach for video-based person re-identification using multiple Convolutional Neural Networks (CNNs). Unlike previous work, we intend to extract a compact yet discriminative appearance representation from several frames rather than the whole sequence. Specifically, given a video, the representative frames are selected based on the walking profile of consecutive fram...

متن کامل

A Differentiable Physics Engine for Deep Learning in Robotics

One of the most important fields in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximat...

متن کامل

Learning Physics with Video Analysis

Inspired by the pioneering work in photographic studies of motion and motion picture projection by Eadweard Muybridge in 1878 and by the high speed films of Harold Edgerton by the middle of the 20th century, the use of video has rapidly emerged nowadays as a powerful tool to teach physics at schools and universities, capturing what the human eye could not distinguish. To highlight the advantage...

متن کامل

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Vision

سال: 2021

ISSN: ['0920-5691', '1573-1405']

DOI: https://doi.org/10.1007/s11263-021-01493-5